15:50
2026-06-03
research.nvidia.com
machine-learning
Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention
Researchers introduced Gated DeltaNet-2, a linear attention model that decouples the erase and write operations in recurrent state updates using separate channel-wise gates. The model outperforms Mamb…